A Cost-based Optimizer for SPARQL Queries
نویسندگان
چکیده
The cost of answering a query against an ontology is affected by at least three elements: the size of the ontology, the strategy followed to combine the data, and the order or plan in which data is processed. In the context of the Semantic Web, very large ontologies have been defined; therefore, techniques to identify efficient evaluation strategies are needed. We propose cost-based optimization techniques for SPARQL queries. In our approach, ontologies are modeled as a deductive database. The extensional database is comprised of meta-level predicates that represent the information explicitly modeled by the ontology; for each RDFS built-in vocabulary term, we define a meta-level predicate (e.g., subClassOf). The intensional database corresponds to the deductive rules that implement the semantics of the vocabulary terms (e.g., the transitive properties of the subClassOf term). Currently, we have developed the following techniques:
منابع مشابه
A Cost Model for Querying Distributed RDF-Repositories with SPARQL
In the last years, the query language SPARQL has evolved into the widely accepted standard for querying RDF. Since many Semantic Web applications make use of data whose storage and management is distributed, distributed SPARQL query processing becomes necessary. In the relation and object-oriented database community the efficiency gain by cost-based, adaptive optimizers for distributed querying...
متن کاملWAVEGUIDE: Evaluating SPARQL Property Path Queries
The extension of SPARQL 1.1 of property paths now o↵ers a type of regular path query for RDF graph databases. While eminently useful, these queries are di cult to optimize to evaluate e ciently. We have embarked on a project we call Waveguide to build a cost-based optimizer for SPARQL queries with property paths. Waveguidemaps the property path to a waveguide plan (WGP) composed of wavefront au...
متن کاملSPARTex: A Vertex-Centric Framework for RDF Data Analytics
A growing number of applications require combining SPARQL queries with generic graph search on RDF data. However, the lack of procedural capabilities in SPARQL makes it inappropriate for graph analytics. Moreover, RDF engines focus on SPARQL query evaluation whereas graph management frameworks perform only generic graph computations. In this work, we bridge the gap by introducing SPARTex, an RD...
متن کاملTowards Query Optimization for SPARQL Property Paths
The extension of SPARQL in version 1.1 with property paths offers a type of regular path query for RDF graph databases. Such queries are difficult to optimize and evaluate efficiently, however. We have embarked on a project, Waveguide, to build a cost-based optimizer for SPARQL queries with property paths. Waveguide builds a query plan—a waveguide plan (WGP)—which guides the query evaluation. T...
متن کاملRDF-3X: a RISC-style engine for RDF
RDF is a data representation format for schema-free structured information that is gaining momentum in the context of Semantic-Web corpora, life sciences, and also Web 2.0 platforms. The “pay-as-you-go” nature of RDF and the flexible pattern-matching capabilities of its query language SPARQL entail efficiency and scalability challenges for complex queries including long join paths. This paper p...
متن کامل